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SUMMARY 


A cDNA clone representing the gene for the E3 glycoprotein, the haemagglutinin, of 
bovine coronavirus was isolated from a plasmid cDNA library of the viral genome and 
sequenced. The gene is located immediately 5’ of the E2 glycoprotein gene on the viral 
genome. Nucleotide sequencing of the E3 gene predicts a polypeptide of 424 amino 
acids with an M, of 47K. Jn vitro translation of mRNA transcribed from the cloned E3 
gene yielded a polypeptide of M, 45K, similar to that predicted from the nucleotide 
sequence. In the presence of microsomal membranes, the in vitro product was 
cotranslationally processed to a 62K polypeptide which comigrated on SDS- 
polyacrylamide gels with the E3 monomer (gp62) obtained from virus-infected cells. 
Both the 45K and 62K polypeptides were immunoprecipitated with E3-specific 
monoclonal antibodies, confirming the identity of the gene as that encoding the E3 
glycoprotein. Finally, only monoclonal antibodies to the E3 protein inhibited 
haemagglutination by the virus thus confirming its identity as the haemagglutinin of 
bovine coronavirus. 


INTRODUCTION 


Bovine coronavirus (BCV) is a member of the family Coronaviridae, a group of enveloped, 
positive-stranded RNA viruses which infect human and animal hosts, causing a number of 
diverse diseases (Siddell et a/., 1983). In neonatal calves, BCV causes an acute and often fatal 
enteritis characterized by intestinal villous atrophy and diarrhoea (Mebus, 1978). 

Coronaviruses multiply in the cytoplasm of infected cells and produce genome size (20 to 
27 kb) and multiple subgenomic mRNAs which form a 3’ coterminal nested set (Siddell et al., 
1983; Jacobs et al., 1986; de Groot et al., 1987; Boursnell et al., 1987; Keck et al., 1988; Stern & 
Kennedy, 1980). In general, only the ‘unique’ 5’ end of each mR NA (sequences not contained in 
the smaller mRNAs) are translated. 

The virions of most coronaviruses are composed of three structural proteins designated N, the 
nucleoprotein, El, a small matrix glycoprotein, and E2, the glycoprotein which forms the large 
characteristic club-shaped peplomers (Siddell et a/., 1983). BCV and the other haemagglutinat- 
ing mammalian coronaviruses, human respiratory coronavirus OC-43 and haemagglutinating 
encephalomyelitis virus of swine, contain a third glycoprotein designated E3 (Pocock & Garwes, 
1977; Storz et al., 1981; King & Brian, 1982; Hogue et a/., 1984; Hogue & Brian, 1986; Deregt et 
al., 1987). The BCV E3 protein is a disulphide-linked dimer of 124K to 140K, reducible to 62K 
to 65K subunits. Recently we have identified the precursor to this protein in pulse-chase 
experiments as a 59K N-glycosylated polypeptide (monomer) (Deregt et al., 1987). This protein 
rapidly dimerizes by intermolecular disulphide bridging before further glycosylation yields the 
mature form of the protein (gp124). These results indicated that the E3 protein was probably a 
dimer made of identical subunits. The apoprotein (monomer) has an M, of 42:5K as determined 
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by studies utilizing tunicamycin, an inhibitor of N-linked glycosylation (Hogue & Brian, 1987). 
Thus, the sugars account for about 35% of the M, of the protein. 

Although a function for the E3 protein has not yet been determined, two important properties 
of this protein are known. First, E3 has been identified as the BCV haemagglutinin (King et al., 
1985). In this report we show that only monoclonal antibodies to the E3 protein inhibit virus 
haemagglutination thus confirming its identity as the haemagglutinin. Second, the E3 protein, 
in addition to the BCV E2 protein, can elicit the production of both in vitro and in vivo 
neutralizing antibodies (Deregt & Babiuk, 1987; D. Deregt et al., unpublished data). Therefore, 
the available evidence indicates that this protein has an important role in BCV-cell interactions. 

In this report we present the sequence of the unique E3 gene of BCV. The nucleotide sequence 
predicts a polypeptide with features characteristic of a membrane glycoprotein: a hydrophobic 
amino-terminal signal sequence and a carboxy-terminal membrane-anchoring domain. The 
cloned gene has also been utilized to direct the in vitro synthesis and glycosylation of a 
polypeptide which is recognized by E3-specific monoclonal antibodies. The results indicate that 
the E3 protein in its mature form is a homodimer. 


METHODS 


Molecular cloning of the BCV E3 gene. The purification of the Quebec strain of BCV (Dea et al., 1980) by 
polyethylene glycol precipitation and gradient centrifugation was performed as described previously (Deregt & 
Babiuk, 1987). Genomic RNA was prepared by phenol :chloroform : isoamy] alcohol (50:48 :2) extraction and 
ethanol precipitation. Synthesis of double-stranded cDNA was carried out essentially as described (Gubler & 
Hoffman, 1983; Maniatis et al., 1982). After addition of BamHI linkers and digestion with BamHI restriction 
endonuclease, cDNA of 3000 bp and greater was purified by agarose gel electrophoresis, ligated into BamHI- 
cleaved pTZ19R (Pharmacia) and used to transform Escherichia coli strain DH1. Recombinants were identified by 
in situ hybridization (Grunstein & Hogness, 1975) using a 32P-labelled cDNA probe prepared by randomly primed 
reverse transcription of viral genomic RNA. 

DNA sequencing. Sequencing was by the dideoxy termination method (Sanger et al., 1977) after generation of a 
series of overlapping deletions (Henikoff, 1984) and transformation of E. coli JM105. Sequence analysis was 
carried out using the DNA/protein analysis system of International Biotechnologies. 

Northern blotting and hybridization. MDBK cells, grown in MEM supplemented with 10% foetal bovine serum, 
_ were infected with BCV at a multiplicity of 1 p.f.u./cell. After 24 h at 37 °C, the monolayers were scraped into 
phosphate-buffered saline and pelleted at 1000 g for 5 min. The cells were lysed by vortexing in a minimal volume 
of 50 mm-Tris-HCI pH 8-0, 100 mm-KCl, 5 mm-EDTA, 1% NP40 and 0-5% sodium deoxycholate. Insoluble 
material was removed by centrifugation for | min at 12000 g and the supernatant was phenol-extracted and 
ethanol-precipitated. Polyadenylated RNA was prepared by oligo(dT)-cellulose chromatography (Aviv & Leder, 
1972). The RNA was electrophoresed on 1 % formaldehyde-agarose gels (Lehrach et al., 1977) and electroblotted 
onto Zeta-probe membranes (Bio-Rad). Radiolabelled probes were prepared by random priming DNA synthesis 
with the Klenow fragment of DNA polymerase I using denatured virus-specific restriction fragments as template. 
Hybridization was carried out at 42 °C for 24 h in 50% formamide, 5 x SSC (1 x SSC is 0-15 M-NaCl, 0-015 M- 
sodium citrate), 5x Denhardt’s solution and 150 pg/ml yeast tRNA. The blots were washed for { h at 60°C in 
0-1 x SSC and 0-1% SDS and radiographed. 

In vitro transcription and translation of the BCV E3 gene. A cassette containing nucleotides 53 to 1350 of the 
sequence in Fig. 3 was prepared by controlled exonuclease IIJ (Henikoff, 1984) digestion of the 5’ BamHI-Bgill 
fragment of clone pCVA12H (Fig. 1). After addition of BamHI linkers, the cassette was inserted into the BamHI 
site of pTZ19R and the nucleotide sequences of the termini of the gene cassette were determined. m’GpppA- 
capped mRNA was produced by the method of Melton et al. (1984) with T7 RNA polymerase after cleavage of the 
plasmid at the Acc] site in the polylinker. The RNA transcripts were translated in rabbit reticulocyte lysates 
containing 500 1Ci/ml [35S]methionine (Amersham; >800 Ci/mmol). The products were immunoprecipitated 
with a pool of three monoclonal antibodies (BD-98C, KC4-3, KD9-40) specific for the BCV E3 glycoprotein 
(Deregt & Babiuk, 1987) and analysed by electrophoresis on 13% acrylamide : N,N’-diallyltartardiamide (30 :1-4) 
gels using the buffer system of Laemmli (1970). 

Haemagglutination and haemagglutination inhibition assay. Haemagglutination of erythrocytes from BALB/c 
mice by BCV was performed as described previously (King & Brian, 1982). For haemagglutination inhibition 
testing, 20 haemagglutinating units of virus were mixed with an equal volume of twofold serial dilutions of each 
monoclonal antibody. After 1 h at room temperature, an equal volume of 0-4% (v/v) erythrocytes in phosphate- 
buffered saline was added and held at room temperature for 45 min. Haemagglutination inhibition titres are 
expressed as the highest dilution of monoclonal antibody at which inhibition of haemagglutination was complete. 
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Fig. 1. Organization of the 3’-terminal 8500 nucleotides of the BCV genome. The positions of the N and 
El genes are indicated. A partial restriction map and relative location of representative cDNA clones 
are indicated. The stippled region of viral genome indicates the sequence shown in Fig. 3. 


RESULTS 
Cloning and sequencing of the BCV E3 gene 


Restriction mapping and nucleotide sequencing of a series of large overlapping cDNA clones 
yielded a linear map representing approximately 10000 nucleotides of the BCV genome (Fig. 1). 
In order to determine the proximity of these sequences to the 3’ end of the virion genome, a 
Northern blot of poly(A)+ RNA from BCV-infected MDBK cells was probed with a 300 bp PstI 
fragment from the 3’ end of clone pCVA12 (Fig. 1; solid boxed area). Eight species of RNA were 
detected with this probe (Fig. 2, lane 1), the largest of which was greater than 25 kb in length and 
apparently the genomic equivalent (Fig. 2, compare lanes | and 2). These results indicated that 
the linear map shown in Fig. | extends to nearly the 3’ end of the viral genome given the nested 
nature of the eight intracellular BCV RNAs (Keck et al., 1988). 

Several additional RNAs were occasionally detected (Fig. 2, lane 1) but their origin is not 
known. They were not likely to be due to the presence of defective interfering particles as 
analysis of the virion RNA (Fig. 2, lane 2) indicated that a single species of RNA was present in 
virus used for infection of cells. 

Based upon alignment of the genomic RNA and the restriction map shown in Fig. 1, it was 
predicted that clones pCVA12H and pCVM would contain cDNA sequences representing the 
unique sequences of mRNAs 3 and 4. RNA 3, previously designated RNA 2a, has been 
suggested to code for the BCV E3 haemagglutinin protein (Keck et al., 1988) because it is absent 
from cells infected with murine hepatitis virus (MHV), a closely related coronavirus which lacks 
a protein analogous to the BCV E3 protein. As predicted, pCVA12H hybridized only to BCV 
mRNAs | to 4 when used to probe RNA from infected cells (Fig. 2, lane 3). The two bands at the 
positions of the 28S and 18S rRNAs are artefacts due to the large amounts of RNA loaded onto 
the gel. 

The cDNA inserts in pCVA12H and pCVM were sequenced completely in both directions. 
On the basis of a comparison with the sequence of the MHV-JHM E2 gene (Schmidt et ail., 
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Fig. 2. Northern blot of bovine coronavirus RNA. RNA 
was electrophoresed on 1°% agarose-formaldehyde gels 

* and electroblotted onto nylon membranes. Lane 1, 

aL poly(A)+ RNA from BCV-infected MDBK cells probed 
with the 3’ Pst] fragment of clone pCVA12; lane 2, 
purified virion RNA probed as in lane 1; lane 3, total 
intracellular RNA from BCV-infected cells probed with 
clone pCVAI2H. The location of 28S and 18S rRNA 
markers is indicated. 


1987), the sequences at the 3’ end of pCVA12H were found to represent most of the sequence of 
the BCV E2 gene (M. D. Parker, unpublished data), thus confirming that mRNA 4 encodes the 
BCV E2 precursor polypeptide. Extending upstream of the E2 gene, in the position predicted to 
contain the unique sequences of mRNA 3, were an additional 1350 nucleotides, which 
contained a large open reading frame (ORF) of 1272 nucleotides. This ORF terminated 14 
nucleotides upstream from the E2 gene and encoded a polypeptide of 424 amino acids with an 
M, of 47-6K (Fig. 3). 

Beginning 16 nucleotides upstream of the proposed initiation codon at nucleotides 62 to 64 for 
the E3 polypeptide is the sequence ACTAAAC, which is similar to a conserved intergenic 


Fig. 3. Nucleotide sequence of the bovine coronavirus E3 gene. The amino acid sequence of the 
predicted polypeptide products of the E3 gene and the IORFs IORF1 and IORF2? are also shown. 
Asterisks indicate conserved intergenic sequences. Underlined Asn residues indicate possible sites for 
addition of N-linked oligosaccharides in the E3 polypeptide product. 
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LWTAATA 
TAGATTGTAATTGTTTGGGGTTTTATGAATCTCCAGTTGAAGAAGACTAAACTCAGTGAAAATGTTTTTGCTTCTTAGATTTGTTCTAGTT AGC - 94 
MetPheLeuLeuLeuArgPheValLeuVal Ser 


TGCATAATTGGTAGCCTAGGTTTTGATAACCCTCCTACCAATGTTGTTTCGCATTTAAATGGAGATTGGTTTTTATTTGGTGACAGTCGTTCA- 187 
CysI lel leGlySerLeuGl yPheAspAsnPr oProThr AsnVa] Val SerHisLeuAsnGl yAspTr pPheLeuPheGl yAspSer ArgSer 
LORF1 MetGlulleGlyPheTyrLeuValThrValValGl 


GATTGTAATCATGTTGTTAATACCAACCCCCGTAATTATTCTTATATGGACCTTAATCCTGCCCTGTGTGATTCTGGTAAAATATCATCTAAA- 260 
AspCysAsnHisValValAsnThrAsnProArgAsnTyrSerTyrMet AspLeuAsnPr oAlaLeuCysAspSerGl yLysI leSerSerLys 
nl leVall leMetLeuLeu! JeProThrProValllel leLeu!leTrpThrLeu! teLeuProCysValI leLeuValLysTyrHisLeuLy 


GCTGGCAACTCCATTTTTAGGAGTTTTCACTTTACCGATTTTTATAATTACACAGGCGAAGGTCAACAAATTATTTTTTATGAGGGTCTT AAT - 373 
AlaGlyAsnSer I lePheArqSerPheHisPheThrAspPheTyrAsnTyrThrGl yGluGlyGInGinI le! lePheTyrGluGlyLeuAsn 
sLeuAlaThrProPheLeuGl yValPheThrLeuProl lePhe! le! leThrGlnAlaLysValAsnLysLeuPhePheMet ArqValLeul] 


TITACGCCTTATCATGCCTTTAAATGCACCACTTCTGGTAGT AATGATATTTGGATGCAGAATAAAGGCTTGTTTTACACTCAGGTTTATAAG- 466 
PheThrProTyrHisAlaPheLysCysThr Thr SerGl ySerAsnAsp! leTr pMetG] nAsnLysGl yLeuPheTyr ThrG1nVal TyrLys 
eLeuArgLeul leMetProLeuAsnAlaProLeuLeuValValMet I lePheGlyCysArgl leLysAlaCysPheThrLeuArgPhel lear 


AATATGGCTGTGTATCGCAGCCTTACTTTIGTTAATGTACCATATGTTTATAATGGCTCTGCACAATCTACAGCTCTTTGTAAATCTGGTAGT- 559 
AsnMetAlaValTyrArgSerLeuThrPheValAsnValProTyrValTyrAsnGlySerAlaGlnSerThrAlaLeuCysLysSerGlySer 
glleTrpLeuCys! LeAlaAlaLeuLeuLeuLeuMetTyrHisMetPhel LeMetAlaLeuHisAsnLeuGlnLeuPheVal AsnLeuVal Va 


TTAGTICTAAATAACCCTGCATATATAGCTCGTGAAGCTAATTTTGGGGATTATTATT ATAAGGTTGAAGCTGACTITTATITGTCAGGTTGT-652 
LeuValLeuAsnAsnProAlaTyrl leAlaArgGluAlaAsnPheGlyAspTyr Tyr TyrLysValGluALaAspPheTyrLeuSerGl yCys 
l--- 


GACGAGTATATCGTACCACTTTGTATTT TT AACGGCAAGTTTTTGTCGAATACAAAGTATTATGATGATAGTCAATATTATTTTAATAAAGAC- 745 
AspGluTyrI leValProLeuCys I lePheAsnG) yLysPheLeuSerAsnThrLysTyr Tyr AspAspSerGlaTyr TyrPheAsnLysAsp 


ACTGGTGTTATTTATGGTCTCAATTCTACTGAAACCATTACCACTGGTITIGATTTTAATIGTCATTATTTAGTTTTACCCTCTGGTAATTAT- 838 
ThrGlyVal!leTyrGlyLeuAsnSerThrGluThr I leThr ThrGl yPheaspPheAsnCysHisTyrLeuValLeuProserGlyAsnTyr 


TTAGCCATTTCAAATGAGCTATTGTTAACTGTTCCTACGAAAGCAATCTGTCTTAACAAGCGTAAGGATTTTACGCCTGTACAGGTTGTTGAT-931 
LeuAlal leSerAsnG)uLeuLeuLeuThr Va lProThrLysAlal leCysLeuAsnLysArgLys/.spPheThrProvalGlnVal¥alAsp 


TCACGGTGGAACAATGCCAGGCAGTCTGAT AACATGACGGCGGTTGCTTGTCAACCCCCGTACTGTTATTTTCGTAATTCTACTACCAACTAT-1024 
SerArgTrpAsnAsnAlaArgGlnSerAspAsnMetThrAlaVa 1 AlaCysGlnProProTyrCysTyrPheargAsnSerthrThrasntyr 
IORF2 He 


GTTGGTGTTTATGATATCAATCATGGGGATGCTGGTITTACTAGCATACTCAGTGCTTTGTTATATGATTCACCTTCTTTTTCGCAGCAAGGT-1117 
ValGlyValTyrAspI leAsnHisGlyAspAlaGlyPheThr Ser I leLeuSerGl yLeuLeuTyrAspSerProCysPheSerGlnG)nGly 
tLeuValPheHet I leSer{ leMetGlyMetLeuValLeuLeuAlatyrSerValValCysTyrMet I leHisLeuValPheArgSerLysVa 


GTTTTTAGGTATGATAATGTTAGCAGTGTCTGGCCTCTCTATTCCTATGGCAGATGCCCTACTGCTGCTGATATTAATACCCCTGATGTACCT-1210 
ValPheArgTyrAspAsnVa Ser SerValTrpProLeuTyrSerTyrGl yArgCysProthrAlaAlaAsp! leAsnThrProAspValPro 
1PheLeuGlyHet I leMetLewAlaValSerGl yLeuSer I leProMetAlaAspAlaLeuLeuLeuLeul leLeul leProLeuMet TyrLe 


ATTTGTGTGTATGATCCGCTACCACTTATTTTGCTTGGCATCCTITTGGCTGTTGCGGTCATAATTATTGTAGTITIGTICTTATATTITATG-1 303 
HeCysValTyrAspProLeuProLeu! leLeuLeuGly! leLeuLeuGl yValAlaVal! leI lel leValValLeuLeuLeuTyrPhexet 
uPheValCysMet I leArgTyrHisLeuPheCysLeuAlaSerPheTrpVal LeuArgSer --- 


RURAL 
GTGGATAATGGTACTAGGCTGCATGATGCTTAGACCATAATCTAAAC - 1350 
ValAspAsnGlyThrArgLeuHisAspAla--- 
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Fig. 4. Hydropathy plot of the predicted polypeptide of the E3 gene. The hydropathicity is calculated 
with a window of four amino acids. Positive values indicate hydrophobic regions and negative values 
indicate hydrophilic regions. 


sequence upstream of the BCV N and El genes (Lapps et al., 1987). A similar sequence, 
TCTAAAC, is also found beginning eight nucleotides downstream from the termination codon 
and immediately precedes the BCV E2 gene (M. D. Parker, unpublished data). The lack of other 
upstream and in-frame ATG codons further supports the idea that this is the initiation codon for 
the E3 polypeptide. Hydropathic analysis of the predicted polypeptide product indicated that 
the product had the characteristics of a membrane protein (Fig. 4). Immediately following the 
presumed initiation codon is a hydrophobic stretch of 15 amino acids which may function as a 
signal sequence for translocation of the polypeptide across the membranes of the rough 
endoplasmic reticulum. Another stretch of extreme hydrophobicity extends from amino acid 
389 to 414 and may serve to anchor the polypeptide in the viral envelope. Beyond the second 
hydrophobic domain, the 10 carboxy-terminal amino acids form a more hydrophilic domain 
which may extend into the interior of the virion. In addition, there are nine possible sites (Asn— 
X-Ser/Thr) for the addition of N-linked oligosaccharides. 

Within the large ORF extending from nucleotides 62 to 1334 are two additional shorter ORFs, 
ORF! and ORF2, both potentially encoding extremely hydrophobic polypeptides. The first of 
the internal reading frames (IORF) begins at nucleotide 153 and terminates at nucleotide 561). 
The second begins at position 1023 and extends to 1272. 


Identification of the gene product of mRNA 3 as the E3 polypeptide 


In order to identify the product of the long ORF shown in Fig. 3, m’GpppA-capped mRNA 
was produced in vitro and translated in a rabbit reticulocyte lysate. Immunoprecipitation of the 
product with a pool of E3-specific monoclonal antibodies yielded a polypeptide of 
approximately 45K (Fig. 5, lane 1). The presence of pancreatic microsomes during in vitro 
translation resulted in quantitative conversion of the 45K polypeptide to a product of 
approximately 62K (Fig. 5, lane 2), indicative of extensive post-translational processing, 
presumably the addition of oligosaccharides. The in vitro processed product migrated on SDS— 
PAGE gels to the same position as the E3 monomer from BCV-infected MDBK cells (Fig. 5, 
lane 3). The E3 monomer obtained from virus-infected cells appeared as a more heterogeneous 
band presumably due to additional post-translational processing of the oligosaccharide chains. 
These results demonstrate that the large ORF shown in Fig. 3 represents the E3 gene of bovine 
coronavirus. 


Confirmation that the E3 protein is the BCV haemagglutinin 


Previously, King et al. (1985) had identified the E3 protein (gp140/gp65) as the 
haemagglutinin based on the observation that bromelain treatment of BCV neither destroyed 
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Fig. 5. Jn vitro synthesis of the product of the largest 
ORF of the BCV E3 gene. Jn vitro transcripts represent- 
ing nucleotides 53 to 1350 of the cloned gene were 
translated in rabbit reticulocyte extracts and immuno- 
precipitated with pooled monoclonal antibodies. Lane 1, 
in vitro translation without added pancreatic micro- 
somes; lane 2, in vitro translation in the presence of 
pancreatic microsomes; lane 3, E3 polypeptide immuno- 
precipitated from BCV-infected MDBK cells; lane 4, as 
in lane 1, immunoprecipitated with normal bovine 
serum; lane 5, immunoprecipitation of BCV-infected 
MDBK cell extracts with normal bovine serum. 
M, x 10-3 indicated at the left. 


Table 1. Haemagglutination inhibition by monoclonal antibodies specific for bovine coronavirus 


Specificity/ Haemagglutination 
Monoclonal antigenic Neutralizing inhibition 

antibody group titre titre 
HC10-5 E3/Al 150000 2560 
KD9-40 E3/A2 50000 2560 
KC4-3 E3/B 1100 <20 
BD9-8C E3/C 1250 40 
HF8-8 E2/A 130000 <60 
HB10-4 E2/A 12500 <40 
BB7-14 E2/B 125000 <40 
Foetal bovine serum <20 


the integrity of the E3 protein nor eliminated the haemagglutinating activity of the virus 
whereas Pronase had the opposite effects. Consistent with this assignment, it was also pointed 
out that the non-haemagglutinating MHV A59 did not contain an analogous protein. However, 
serological evidence for this assignment has been lacking. 
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To determine whether any of our monoclonal antibodies with specificity for the BCV E2 or E3 
proteins (Deregt & Babiuk, 1987) could inhibit haemagglutination of BALB/c erythrocytes by 
BCV, virus was incubated separately with these antibodies in a haemagglutination inhibition 
assay. The results show that only E3-specific monoclonal antibodies, HC10-5 and KD9-40 could 
significantly inhibit haemagglutination by BCV, thus confirming that the E3 protein is the BCV 
haemagglutinin (Table 1). 


DISCUSSION 


In this report, evidence has been presented that the sequence shown in Fig. 3 is that of the E3 
glycoprotein monomer. First, the nucleotide sequence of the BCV E3 gene predicts a product 
with the typical hallmarks of a membrane protein; the presence of a putative hydrophobic 
amino-terminal signal sequence and a second hydrophobic region at the carboxy terminus which 
probably serves as a membrane anchoring domain (Garoff, 1985). Second, in the absence of 
microsomal membranes, in vitro translation of mRNA produced from the cloned gene yielded a 
45K polypeptide. This is similar in size to that predicted from the deduced amino acid sequence 
(47K) and also similar to the M, (42-5K) determined for the E3 apoprotein by studies utilizing 
tunicamycin, an inhibitor of N-linked glycosylation (Hogue & Brian, 1987). Third, when 
translated in the presence of microsomal membranes, MRNA produced from the cloned gene 
directed the synthesis of a 62K polypeptide. Processing of the 45K polypeptide to the 62K 
polypeptide probably involved addition of N-linked sugars. There are nine potential N-linked 
glycosylation sites in the sequence shown in Fig. 3, a sufficient number to accommodate the 
notion that this increase in M, is due solely to glycosylation (Klenk & Rott, 1980, also see below). 
However, other forms of post-translational modification such as fatty acylation may also be 
present. Fourth, the 62K polypeptide produced in the presence of microsomal membranes had 
the same M, as the E3 monomer (gp62) from BCV-infected cells. Finally, both 45K and 62K in 
vitro products obtained by translation of mRNA produced from the cloned gene could be 
immunoprecipitated by E3-specific monoclonal antibodies. 

The intracellular precursor to E3 is an N-glycosylated protein of 59K which is subsequently 
processed to the mature 62K form (Deregt et al., 1987). With an average M, of 2000 for N-linked 
chains of the high mannose type and 3000 for those of the complex type, we estimate that each 
59K monomer would contain six or seven N-linked chains of the high mannose type assuming 
that the sugars on the 59K precursor polypeptide are the result of core glycosylation (Klenk & 
Rott, 1980). Each mature E3 monomer would then have an estimated three or four N-linked 
chains of the complex type. The presence of high mannose and complex oligosaccharides on the 
E3 monomer may explain the apparent heterogeneity of the E3 monomer immunoprecipitated 
from infected cells. Thus, seven of the nine potential glycosylation sites are predicted to be 
utilized for the addition of carbohydrates to the BCV E3 polypeptide. 

From pulse-chase experiments, it was concluded that the E3 protein (gp124) was a 
homodimer (Deregt et al., 1987). Hogue & Brian (1987) have also made this suggestion based 
upon their deglycosylation studies which gave no indication that two different subunits existed. 
From M, considerations and the sequence of the E3 gene in Fig. 3, it is evident that the mature 
E3 protein is a disulphide-linked dimer of identical units. Viral proteins of this type appear to be 
rare. Other examples include the neuraminidase of influenza viruses and the haemagglutinin— 
neuraminidase protein of paramyxoviruses (Lazdins et al., 1972; Varghese et al., 1983; 
Markwell & Fox, 1980). 

The E3 gene contains two IORFs in addition to the sequence encoding the E3 polypeptide. 
The existence of multiple ORFs in other coronavirus genes, including the N protein gene of 
BCV (Lapps et al., 1987) and non-structural polypeptide genes of both infectious bronchitis 
virus (Boursnell et al., 1985) and MHV (Budzilowicz et al., 1985) have been reported previously. 
We have been able to produce a polypeptide in vitro encoded by the IORF extending from 
nucleotides 153 to 561, but the 15K product was not bound by bovine convalescent serum 
antibodies (M. D. Parker, unpublished data). Neither of the two internal reading frames is 
preceded by the consensus (T/A)CTAAAC sequence characteristic of the intergenic regions of 
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BCV, proposed to be involved in transcriptional control of coronaviruses (Baric et al., 1987). 
Nor are the proposed initiation codons in favourable sequence contexts. Thus, the question as to 
whether either of these ORFs is expressed during infection remains unanswered. 

The identity of the E3 protein as the BCV haemagglutinin was confirmed in that only E3- 
specific monoclonal antibodies were effective in inhibiting virus haemagglutination. The two 
E3-specific monoclonal antibodies that exhibited haemagglutination inhibition both recognized 
different epitopes within the same antigenic domain (A). They also exhibited the highest 
neutralization titres of our panel of E3 monoclonal antibodies (Deregt & Babiuk, 1987). This 
suggests that the site on the E3 protein responsible for interaction with red blood cell receptors 
may be identical with or in close proximity to a site which is involved in BCV infectivity. 

The origin and function of the BCV haemagglutinin is presently unknown. The question 
arises as to what additional function is provided by the E3 glycoprotein in addition to those 
activities which are normally carried out by the E2 glycoprotein in coronaviruses which lack the 
E3 gene. The availability of both neutralizing monoclonal antibodies and the cloned genes for 
these proteins should aid in the identification of the role played by the E3 glycoprotein in the 
replication of BCV. 

A search of 4248 amino acid sequences in the protein sequence database of the National 
Biomedical Research Foundation indicated that the E3 polypeptide of BCV bears significant 
similarity to the HA1 of type C influenza viruses (Nakada et al., 1984; Pfeifer & Compans, 
1984). Homologous amino acid sequences begin immediately following the putative signal 
sequence of the E3 protein and extend to just before the putative carboxy-terminal anchoring 
domain. This evidence of a common ancestral protein for the BCV E3 protein and the 
haemagglutinin (HA1) subunit for influenza viruses will be discussed further elsewhere. 


This work was supported by grants from the Natural Sciences and Engineering Research Council of Canada 
and is published with permission of the Director of the Veterinary Infectious Disease Organization as article 
no. 72. 
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